Near-Synonym Choice in an Intelligent Thesaurus
نویسنده
چکیده
An intelligent thesaurus assists a writer with alternative choices of words and orders them by their suitability in the writing context. In this paper we focus on methods for automatically choosing nearsynonyms by their semantic coherence with the context. Our statistical method uses the Web as a corpus to compute mutual information scores. Evaluation experiments show that this method performs better than a previous method on the same task. We also propose and evaluate two more methods, one that uses anticollocations, and one that uses supervised learning. To asses the difficulty of the task, we present results obtained by human judges.
منابع مشابه
Near-Synonym Choice using a 5-gram Language Model
In this work, an unsupervised statistical method for automatic choice of near-synonyms is presented and compared to the stateof-the-art. We use a 5-gram language model built from the Google Web 1T data set. The proposed method works automatically, does not require any human-annotated knowledge resources (e.g., ontologies) and can be applied to different languages. Our evaluation experiments sho...
متن کاملV-l Design Criteria for Automatic Information Systems
s (150 words) Titles Only Abstracts Null Summaries Nulls Null Summaries Null Synonym Recognition (Thesaurus) only Word Stem Match Synonym Recognition Statistical Phrase Recognition Simple Word Stems Synonym Recognition Index Term Match Abstracts with Thesarus » "much greater than > "greater than" ~ "about equal to" (based on experiments with 4 collections in 3 topic areas) Overall Evaluation Re...
متن کاملNear-synonym choice in natural language generation
We present Xenon, a natural language generation system capable of distinguishing between nearsynonyms. It integrates a near-synonym choice module with an existing sentence realization module. We evaluate Xenon using English and French nearsynonyms.
متن کاملUnsupervised Phrasal Near-Synonym Generation from Text Corpora
Unsupervised discovery of synonymous phrases is useful in a variety of tasks ranging from text mining and search engines to semantic analysis and machine translation. This paper presents an unsupervised corpus-based conditional model: Near-Synonym System (NeSS) for finding phrasal synonyms and near synonyms that requires only a large monolingual corpus. The method is based on maximizing informa...
متن کاملSemantic Representations of Near-Synonyms for Automatic Lexical Choice
Semantic Representations of Near-Synonyms for Automatic Lexical Choice Philip Edmonds Doctor of Philosophy Graduate Department of Computer Science University of Toronto 1999 We develop a new computational model for representing the fine-grained meanings of nearsynonyms and the differences between them. We also develop a sophisticated lexical-choice process that can decide which of several near-...
متن کامل